[HF][streaming][1/n] Text Summarization #851

rossdanlm · 2024-01-10T06:25:34Z

[HF][streaming][1/n] Text Summarization

TSIA

Adding streaming functionality to text summarization model parser

Test Plan

Rebase onto and test it with 11ace0a.

Follow the README from AIConfig Editor https://github.com/lastmile-ai/aiconfig/tree/main/python/src/aiconfig/editor#dev, then run these command

aiconfig_path=/Users/rossdancraig/Projects/aiconfig/cookbooks/Gradio/huggingface.aiconfig.json
parsers_path=/Users/rossdancraig/Projects/aiconfig/cookbooks/Gradio/hf_model_parsers.py
alias aiconfig="python3 -m 'aiconfig.scripts.aiconfig_cli'"
aiconfig edit --aiconfig-path=$aiconfig_path --server-port=8080 --server-mode=debug_servers --parsers-module-path=$parsers_path

Then in AIConfig Editor run the prompt (it will be streaming format by default)

Screen.Recording.2024-01-10.at.01.16.45.mov

Stack created with Sapling. Best reviewed with ReviewStack.

TSIA Adding streaming functionality to text summarization model parser ## Test Plan Rebase onto and test it with 11ace0a. Follow the README from AIConfig Editor https://github.com/lastmile-ai/aiconfig/tree/main/python/src/aiconfig/editor#dev, then run these command ```bash aiconfig_path=/Users/rossdancraig/Projects/aiconfig/cookbooks/Gradio/huggingface.aiconfig.json parsers_path=/Users/rossdancraig/Projects/aiconfig/cookbooks/Gradio/hf_model_parsers.py alias aiconfig="python3 -m 'aiconfig.scripts.aiconfig_cli'" aiconfig edit --aiconfig-path=$aiconfig_path --server-port=8080 --server-mode=debug_servers --parsers-module-path=$parsers_path ``` Then in AIConfig Editor run the prompt (it will be streaming format by default) https://github.com/lastmile-ai/aiconfig/assets/151060367/e91a1d8b-a3e9-459c-9eb1-2d8e5ec58e73

saqadri · 2024-01-10T14:45:26Z

...HuggingFace/python/src/aiconfig_extension_hugging_face/local_inference/text_summarization.py

+            new_text = new_text.replace("</s>", "")
+            new_text = new_text.replace("<s>", "")


Do these just signify the start and end of the stream?

Not quite! I'm really not sure why they show up for a few models while for TextGeneration this isn't needed.

I saw something like </s><s>Streaming Text</s> But yea just removed so it looks clean

saqadri · 2024-01-10T14:47:42Z

...HuggingFace/python/src/aiconfig_extension_hugging_face/local_inference/text_summarization.py

+        should_stream = (options.stream if options else False) and (
+            not "stream" in completion_data or completion_data.get("stream") != False
+        )


This is a very complicated logic. Ideally would prefer to break this out into if/else or create a helper function for this, but not ship-blocking if it works. I just am struggling to think how this would work.

For e.g. if no options is specified, stream will be false, even if there is stream in completion_data. Is that correct behavior?

if no options is specified, stream will be false, even if there is stream in completion_data. Is that correct behavior?

That is correct! The reasoning for this is that stream isn't really a model setting, it's an inference setting which we would need to pass in a stream callback, so streaming without it doesn't really make sense if it's not included (where would you stream out to?).

Sure I made a task to split this into a helper function: #861

[HF][5/n] Image2Text: Allow base64 inputs for images Before we didn't allow base64, only URI (either local or http or https). This is good becuase our text2Image model parser outputs into a base64 format, so this will allow us to chain model prompts! ## Test Plan Rebase and test on 0d7ae2b. Follow the README from AIConfig Editor https://github.com/lastmile-ai/aiconfig/tree/main/python/src/aiconfig/editor#dev, then run these command ```bash aiconfig_path=/Users/rossdancraig/Projects/aiconfig/cookbooks/Gradio/huggingface.aiconfig.json parsers_path=/Users/rossdancraig/Projects/aiconfig/cookbooks/Gradio/hf_model_parsers.py alias aiconfig="python3 -m 'aiconfig.scripts.aiconfig_cli'" aiconfig edit --aiconfig-path=$aiconfig_path --server-port=8080 --server-mode=debug_servers --parsers-module-path=$parsers_path ``` Then in AIConfig Editor run the prompt (streaming not supported so just took screenshots) These are the images I tested (with bear being in base64 format) ![fox_in_forest](https://github.com/lastmile-ai/aiconfig/assets/151060367/ca7d1723-9e12-4cc8-9d8d-41fa9f466919) ![bear-eating-honey](https://github.com/lastmile-ai/aiconfig/assets/151060367/a947d89e-c02a-4c64-8183-ff1c85802859) <img width="1281" alt="Screenshot 2024-01-10 at 04 57 44" src="https://github.com/lastmile-ai/aiconfig/assets/151060367/ea60cbc5-e6ab-4bf2-82e7-17f3182fdc5c"> --- Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/lastmile-ai/aiconfig/pull/856). * __->__ #856 * #855 * #854 * #853 * #851

rossdanlm mentioned this pull request Jan 10, 2024

Testing streaming outputs #852

Draft

rossdanlm force-pushed the pr851 branch from 322ed18 to c962624 Compare January 10, 2024 07:02

rossdanlm mentioned this pull request Jan 10, 2024

[HF][streaming][2/n] Text Translation #853

Merged

rossdanlm force-pushed the pr851 branch from c962624 to cd0e811 Compare January 10, 2024 07:07

This was referenced Jan 10, 2024

[HF][streaming][3/n] Text2Speech (no streaming, but updating docs on completion params) #854

Merged

[HF][streaming][4/n] Image2Text (no streaming, but lots of fixing) #855

Merged

rossdanlm marked this pull request as ready for review January 10, 2024 09:26

rossdanlm requested review from saqadri, rholinshead, suyoglastmileai, Ankush-lastmile and jonathanlastmileai as code owners January 10, 2024 09:26

rossdanlm mentioned this pull request Jan 10, 2024

[HF][5/n] Image2Text: Allow base64 inputs for images #856

Merged

rossdanlm force-pushed the pr851 branch from cd0e811 to 074b768 Compare January 10, 2024 10:08

saqadri approved these changes Jan 10, 2024

View reviewed changes

saqadri merged commit 074b768 into main Jan 10, 2024

rossdanlm mentioned this pull request Jan 10, 2024

Move should_stream logic into a helper function #861

Open

rossdanlm deleted the pr851 branch January 10, 2024 18:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HF][streaming][1/n] Text Summarization #851

[HF][streaming][1/n] Text Summarization #851

rossdanlm commented Jan 10, 2024 •

edited

Loading

saqadri Jan 10, 2024

rossdanlm Jan 10, 2024

saqadri Jan 10, 2024

rossdanlm Jan 10, 2024

		new_text = new_text.replace("</s>", "")
		new_text = new_text.replace("<s>", "")

[HF][streaming][1/n] Text Summarization #851

[HF][streaming][1/n] Text Summarization #851

Conversation

rossdanlm commented Jan 10, 2024 • edited Loading

Test Plan

saqadri Jan 10, 2024

Choose a reason for hiding this comment

rossdanlm Jan 10, 2024

Choose a reason for hiding this comment

saqadri Jan 10, 2024

Choose a reason for hiding this comment

rossdanlm Jan 10, 2024

Choose a reason for hiding this comment

rossdanlm commented Jan 10, 2024 •

edited

Loading